Document Clustering for Social Problem Detection and Cluster Evaluation Measures

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Association Coefficient Measures for Document Clustering

This paper presents Association Coefficient Measures for Document Clustering. The proposed Association Coefficient Measures approach is based on Intuitionistic Fuzzy Sets. In this paper twelve Association Coefficient Measures from f1 to f12 are used. In Document Clustering Document collection, Text Pre-processing, Feature Selection, Indexing, Clustering Process and Results Analysis steps are us...

متن کامل

Candidate Cluster Extraction for Hierarchical Document Clustering

Text Document are tremendously increasing in the internet, the hierarchical document clustering has proven to be useful in grouping similar document for large applications. Still most documents suffer from problems of high dimensionality, scalability, accuracy and meaningful cluster labels. In this paper an new approach fuzzy frequent itemsets based hierarchical clustering is proposed, in which...

متن کامل

Similarity Measures for Text Document Clustering

Clustering is a useful technique that organizes a large quantity of unordered text documents into a small number of meaningful and coherent clusters, thereby providing a basis for intuitive and informative navigation and browsing mechanisms. Partitional clustering algorithms have been recognized to be more suitable as opposed to the hierarchical clustering schemes for processing large datasets....

متن کامل

Optimum Cluster Labeling and Document Clustering for Forensic Analysis

Document clustering or unsupervised document classification is an automated process of grouping documents with similar content. Document clustering is an important task in many Information Retrieval systems. Also document clustering Algorithms can help in discovery of new and useful knowledge or novel class from the documents under analysis. This knowledge or novel class is very important issue...

متن کامل

Challenging Issues and Similarity Measures for Web Document Clustering

Web itself contains a large amount of documents available in electronic form. The available documents are in various forms and the information in them is not in organized form. The lack of organization of materials in the WWW motivates people to automatically manage the huge amount of information. Textmining refers generally to the process of extracting interesting and non-trivial information a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Japanese Society for Artificial Intelligence

سال: 2009

ISSN: 1346-0714,1346-8030

DOI: 10.1527/tjsai.24.333